Update Float8Tensor for GRPO training in unsloth by andrewor14 · Pull Request #3158 · pytorch/ao

andrewor14 · 2025-10-12T22:21:23Z

Summary: Support a few extra ops called during GRPO loop in unsloth/vllm for Float8Tensor.

Test Plan:

python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_fp8_matmul_lora_variants
python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_to_dtype_layout
python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_has_compatible_shallow_copy_type
python test/quantization/quantize_/workflows/float8/test_float8_tensor.py -k test_transpose

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Float8Tensor for GRPO training in unsloth#3158

Update Float8Tensor for GRPO training in unsloth#3158
andrewor14 merged 1 commit intomainfrom
unsloth-fp8-rl-test

andrewor14 commented Oct 12, 2025 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

andrewor14 commented Oct 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andrewor14 commented Oct 12, 2025 •

edited

Loading